A policy gradient method for semi-Markov decision processes with application to call admission control
نویسندگان
چکیده
Solving a semi-Markov decision process (SMDP) using value or policy iteration requires precise knowledge of the probabilistic model and suffers from the curse of dimensionality. To overcome these limitations, we present a reinforcement learning approach where one optimizes the SMDP performance criterion with respect to a family of parameterised policies. We propose an online algorithm that simultaneously estimates the gradient of the performance criterion and optimises it using stochastic approximation. We apply our algorithm to call admission control. Index Terms Stochastic processes; Semi-Markov decision process; Policy gradient; Two-time scale; Call admission control. ∗Corresponding author. S. Singh is with the Signal Processing Group, Department of Engineering, Cambridge University, CB2 1PZ Cambridge, UK. Email: [email protected] Tel: +44 1223 332 784 Fax: +44 1223 332 662 V. Tadić is with the Department of Automatic Control and Systems Engineering, The University of Sheffield, S1 3JD Sheffield, UK. Email: [email protected] Tel: +44 114 222 5198 Fax: +44 (0)114 222 5661 A. Doucet is with the Signal Processing Group, Department of Engineering, Cambridge University, CB2 1PZ Cambridge, UK. Email: [email protected] Tel: +44 1223 332 676 Fax: +44 1223 332 662 February 14, 2005 DRAFT
منابع مشابه
Call Admission Control in Wireless Ds-cdma Systems Using Reinforcement Learning
THAI) สาขาวิชาวิศวกรรมโทรคมนาคม ลายมือช่ือนักศึกษา ปการศึกษา 2549 ลายมือช่ืออาจารยที่ปรึกษา PITIPONG CHANLOHA : CALL ADMISSION CONTROL IN WIRELESS DS-CDMA SYSTEMS USING REINFORCEMENT LEARNING. THESIS ADVISOR : ASST. PROF. WIPAWEE HATTAGAM, Ph.D. 95 PP. ABSTRACT (ENGLISH) DIRECT-SEQUENTIAL CODE DIVISION MULTIPLE ACCESS (DS-CDMA)/ CALL ADMISSION CONTROL/ REINFORCEMENT LEARNING/ ACTOR-CRITIC REINFO...
متن کاملOptimal Distributed Call Admission Control for Multimedia Services in Mobile Cellular Network
| There is a growing interest in providing multi-media services in mobile/wireless communication networks. Call admission control (CAC) is a key factor in supporting successfully these services. In this paper, we propose a distributed call admission control policy which is optimal from the standpoint of service providers because it maximizes their revenue. A semi-Markov decision process is empl...
متن کاملIntegrated voice/data call admission control for wireless DS-CDMA systems
This paper addresses the call admission control problem for multiservice wireless code division multiple access (CDMA) cellular systems when the physical layer channel and receiver structure at the base station are taken into account. The call admission problem is formulated as a semi-Markov decision process with constraints on the blocking probabilities and signal-to-interference ratio (SIR). ...
متن کاملCall Admission Control for Multimedia Cellular Networks Using Neuro-dynamic Programming
We consider, in this paper, the call admission control (CAC) problem in a multimedia cellular network that handles several classes of traffic with different resource requirements. The problem is formulated as a Semi-Markov Decision Process (SMDP) problem. It is too complex to allow for an exact solution for this problem, so, we use a real-time neuro -dynamic programming (NDP) [Reinforcement Lea...
متن کاملNew Channel Assignments in Cellular Networks: A Reinforcement Learning Solution
The optimization of channel assignment in cellular networks is a very complex optimization problem and it becomes more difficult when the network handles different classes of traffic. The objective is that channel utility be maximized so as to maximize service in a stochastic caller environment. We address in this paper, the dynamic channel assignment (DCA) combined with call admission control ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- European Journal of Operational Research
دوره 178 شماره
صفحات -
تاریخ انتشار 2007